Classifying the socio-situational settings of transcripts of spoken discourses

نویسندگان

  • Yangyang Shi
  • Pascal Wiggers
  • Catholijn M. Jonker
چکیده

In this paper, we investigate automatic classification of the socio-situational settings of transcripts of a spoken discourse. Knowledge of the socio-situational setting can be used to search for content recorded in a particular setting or to select context-dependent models for example in speech recognition. The subjective experiment we report on in this paper shows that people correctly classify 68% the sociosituational settings. Based on the cues that participants mentioned in the experiment, we developed two types of automatic socio-situational setting classification methods; a static socio-situational setting classification method using support vector machines (S3C-SVM), and a dynamic socio-situational classification method applying dynamic Bayesian networks (S3C-DBN). Using these two methods, we developed classifiers applying various features and combinations of features. The S3C-SVM method with sentence length, function word ratio, single occurrence word ratio, part of speech (POS) and words as features results in a classification accuracy of almost 90%. Using a bigram S3C-DBN with POS tag and word features results in a dynamic classifier which can obtain nearly 89% classification accuracy. The dynamic classifiers not only can achieve similar results as the static classifiers, but also can track the socio-situational setting while processing a transcript or conversation. On discourses with a static social situational setting, the dynamic classifiers only need the initial 25% of data to achieve a classification accuracy close to the accuracy achieved when all data of a transcript is used. 2013 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Core Units of Spoken Grammar in Global ELT Textbooks

Materials evaluation studies have constantly demonstrated that there is no one fixed procedure for conducting textbook evaluation studies. Instead, the criteria must be selected according to the needs and objectives of the context in which evaluation takes place. The speaking skill as part of the communicative competence has been emphasized as an important objective in language teaching. The pr...

متن کامل

Death risk classifying in patients with internal medical emergencies in pre-hospital settings

Introduction: In a pre-hospital emergency, identifying high-risk medical patients and appropriate decision making is very important. The classifying of life-threatening risks in pre-hospital settings can improve the decision-making process. This study was purposed to classify the risk level of death in patients in pre-hospital emergency settings. Materials and Methods: This study was a descript...

متن کامل

The Impact of Translation in Emerging Iran’s Political–Religious Intellectual Discourses and Socio-Cultural Changes (From Early Qajar Dynasty to the End of Reza Shah Era)

Wide-ranging sociological studies have been conducted on the history of Iranian intellectuality and modernism. The findings jointly acknowledge that due to communication with the West and following the effects that was received from modernism, the first generation of Iranian intellectualism was emerged. It is said that having benefited from the translated works in its general concept, i.e. incl...

متن کامل

The Role of Disfluencies in Topic Classification of Human-Human Conversations

We investigate the impact of disfluencies on the task of classifying natural human-human conversations into topics. Disfluencies are distinctive to spoken language, and their effect on a number of spoken language understanding tasks, including spoken language classification, remains largely unknown. We use a subset of Switchboard-I annotated for disfluencies and topics, and investigate the effe...

متن کامل

Adaptive Language Modeling with a Set of Domain Dependent Models

An adaptive language modeling method is proposed in this paper. Instead of using one static model for all situations, it applies a set of specific models to dynamically adapt to the discourse. We present the general structure of the model and the training procedure. In our experiments, we instantiated the method with a set of domain dependent models which are trained according to different soci...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Speech Communication

دوره 55  شماره 

صفحات  -

تاریخ انتشار 2013